Outside the cave of shadows: using syntactic annotation to enhance authorship attribution
Identifieur interne : 002766 ( Main/Exploration ); précédent : 002765; suivant : 002767Outside the cave of shadows: using syntactic annotation to enhance authorship attribution
Auteurs : H. Baayen [Pays-Bas] ; H. Van Halteren [Pays-Bas] ; F. Tweedie [Royaume-Uni]Source :
- Literary and Linguistic Computing [ 0268-1145 ] ; 1996-09.
Abstract
This paper reports an experiment in authorship attribution in which statistical measures and methods that have been widely applied to words and their frequencies of use are applied to rewrite rules as they appear in a syntactically annotated corpus. The outcome of this experiment suggests that the frequencies with which syntactic rewrite rules are put to use provide a better clue to authorship than word usage. Complementary methods focusing on the high-frequency head and the low-frequency tail of the distribution independently reveal a higher resolution than traditional word-based analyses, and promise enhanced accuracy for authorship attribution.
Url:
DOI: 10.1093/llc/11.3.121
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 002796
- to stream Istex, to step Curation: 002607
- to stream Istex, to step Checkpoint: 001B77
- to stream Main, to step Merge: 002910
- to stream Main, to step Curation: 002766
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Outside the cave of shadows: using syntactic annotation to enhance authorship attribution</title>
<author><name sortKey="Baayen, H" sort="Baayen, H" uniqKey="Baayen H" first="H" last="Baayen">H. Baayen</name>
</author>
<author><name sortKey="Van Halteren, H" sort="Van Halteren, H" uniqKey="Van Halteren H" first="H" last="Van Halteren">H. Van Halteren</name>
</author>
<author><name sortKey="Tweedie, F" sort="Tweedie, F" uniqKey="Tweedie F" first="F" last="Tweedie">F. Tweedie</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:5F5DCCD19520C10BAEF5530E87D9BDF498D3E4AE</idno>
<date when="1996" year="1996">1996</date>
<idno type="doi">10.1093/llc/11.3.121</idno>
<idno type="url">https://api.istex.fr/document/5F5DCCD19520C10BAEF5530E87D9BDF498D3E4AE/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">002796</idno>
<idno type="wicri:Area/Istex/Curation">002607</idno>
<idno type="wicri:Area/Istex/Checkpoint">001B77</idno>
<idno type="wicri:doubleKey">0268-1145:1996:Baayen H:outside:the:cave</idno>
<idno type="wicri:Area/Main/Merge">002910</idno>
<idno type="wicri:Area/Main/Curation">002766</idno>
<idno type="wicri:Area/Main/Exploration">002766</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Outside the cave of shadows: using syntactic annotation to enhance authorship attribution</title>
<author><name sortKey="Baayen, H" sort="Baayen, H" uniqKey="Baayen H" first="H" last="Baayen">H. Baayen</name>
<affiliation wicri:level="3"><country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Corresponding author at: Max Planck Institute for Psycholinguistics, Wundtlaan 1, 6525XD, Nijmegen</wicri:regionArea>
<placeName><settlement type="city">Nimègue</settlement>
<region type="province" nuts="2">Gueldre</region>
</placeName>
</affiliation>
<affiliation><wicri:noCountry code="syntax">???</wicri:noCountry>
</affiliation>
</author>
<author><name sortKey="Van Halteren, H" sort="Van Halteren, H" uniqKey="Van Halteren H" first="H" last="Van Halteren">H. Van Halteren</name>
<affiliation wicri:level="3"><country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Catholic University of Nijmegen, Nijmegen</wicri:regionArea>
<placeName><settlement type="city">Nimègue</settlement>
<region type="province" nuts="2">Gueldre</region>
</placeName>
</affiliation>
<affiliation><wicri:noCountry code="syntax">???</wicri:noCountry>
</affiliation>
</author>
<author><name sortKey="Tweedie, F" sort="Tweedie, F" uniqKey="Tweedie F" first="F" last="Tweedie">F. Tweedie</name>
<affiliation wicri:level="1"><country xml:lang="fr">Royaume-Uni</country>
<wicri:regionArea>The University of the West of England, Bristol</wicri:regionArea>
<wicri:noRegion>Bristol</wicri:noRegion>
</affiliation>
<affiliation><wicri:noCountry code="syntax">???</wicri:noCountry>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Literary and Linguistic Computing</title>
<title level="j" type="abbrev">Lit Linguist Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint><publisher>Oxford University Press</publisher>
<date type="published" when="1996-09">1996-09</date>
<biblScope unit="volume">11</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="121">121</biblScope>
<biblScope unit="page" to="132">132</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">5F5DCCD19520C10BAEF5530E87D9BDF498D3E4AE</idno>
<idno type="DOI">10.1093/llc/11.3.121</idno>
<idno type="local">2</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">This paper reports an experiment in authorship attribution in which statistical measures and methods that have been widely applied to words and their frequencies of use are applied to rewrite rules as they appear in a syntactically annotated corpus. The outcome of this experiment suggests that the frequencies with which syntactic rewrite rules are put to use provide a better clue to authorship than word usage. Complementary methods focusing on the high-frequency head and the low-frequency tail of the distribution independently reveal a higher resolution than traditional word-based analyses, and promise enhanced accuracy for authorship attribution.</div>
</front>
</TEI>
<affiliations><list><country><li>Pays-Bas</li>
<li>Royaume-Uni</li>
</country>
<region><li>Gueldre</li>
</region>
<settlement><li>Nimègue</li>
</settlement>
</list>
<tree><country name="Pays-Bas"><region name="Gueldre"><name sortKey="Baayen, H" sort="Baayen, H" uniqKey="Baayen H" first="H" last="Baayen">H. Baayen</name>
</region>
<name sortKey="Van Halteren, H" sort="Van Halteren, H" uniqKey="Van Halteren H" first="H" last="Van Halteren">H. Van Halteren</name>
</country>
<country name="Royaume-Uni"><noRegion><name sortKey="Tweedie, F" sort="Tweedie, F" uniqKey="Tweedie F" first="F" last="Tweedie">F. Tweedie</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002766 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002766 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:5F5DCCD19520C10BAEF5530E87D9BDF498D3E4AE |texte= Outside the cave of shadows: using syntactic annotation to enhance authorship attribution }}
This area was generated with Dilib version V0.6.32. |